CDS

Accession Number TCMCG064C24291
gbkey CDS
Protein Id XP_020552959.1
Location join(1271995..1272099,1272755..1272853,1273125..1273187,1273856..1273933,1274353..1274539,1275051..1275316,1275386..1275749,1276264..1276338,1276922..1277199,1277286..1277342,1277659..1277970,1278052..1278230,1278339..1278621,1279425..1279649,1280118..1280279,1280577..1280720,1280827..1281045,1281578..1281682,1281769..1281864,1281978..1282142)
Gene LOC105171494
GeneID 105171494
Organism Sesamum indicum

Protein

Length 1153aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_020697300.1
Definition protein ALWAYS EARLY 3-like [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category BDT
Description SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGCCCCACGAGAAAGTCTAGAAGTGTGAATAAGCGATATTCTCAAGTGAACGAAGTATCTCCTAGCAAAGATGGAGATGGTTCTAAAAGAAGCAATAGTCGTAAAAGGAAGTTGTCTGACATGCTTGGGCCTCGATGGACCATGGAAGAGCTAACTCGTTTTTATGATTCTTACCGCAAGAATGGTAAAGATTGGAAAAAGGTGGCCAACGCTGTGAAAAATCGATCTTCAGAAATGGCAGAGGCTCTTTACACAATGAACAGGGCGTACTTATCTCTTCCACACGGAACTGCTTCTGCAGCGGGGTTGATTGCTATGATGACTGATCACTACTGCAATTTGGCAGGAACTGATAGTGACCAAGAAAGCAATGATGGAGTGGAATCATCTCAAAAGACTCAGAAGCGTGCTCGTGGCAAAGTCCAACCTCCTACCTCTAAACCATCAGCCGATCCATTTGTGCCGCATTCTCCAACCATAACCTCAAATTATGGTTGTCTGTCATTGTTAAAGAAGAAACGCTCTGGTGGAACCAGACCTCGTCCTGTTGGAAAAAGGACTCCAAGGTTCCCCGTCTCATATTCATATGAAAATATCAATGGGGAAAAATATTTTTCTCCAACAAGGCAAGGCTTGAAACTCAAGGCCAGCACTGATGACGATGAAGTAGCTCATGAAGTAGCCATAGCTTTGGCAGAGGCATCGCAGAGAGGTGGTTCTCCTCAGGTTTCTGGAACTCCCAGTAAAAGAGCTGAAAGTGTCATGTCATCACCTTTTAGGCATGCTGAAAGAAAGAATTCTGTAGCAGAAATGGTCAATGCCAAGCCCTTGGCTGCTGACACGGATGAAGAGGATTTGGAAGGAAGCACAGAAGCTGACACTGGCGAGTTGTCTGGGTATAAGCCTTGCATGACGGAATCTGCAAGTTTCCTTACAACAAGGCAGAAGGGGACTAAAGTTGAAGGGAAGAAGATTGAAGTTGATAACAATAATCAGAGTCATTTGGATAACATCAACGAAGAATGCAGTGGAACAGAGGAAGGTCAAAGGTTAGGTGCAACAAGTGGGAAATTTGATGTAGAGGTTAATAACACCAAGAATTCGAGGTCCTTCATGCAGAGTCAAAAAAAGAAGAGCAAAAAGGTTCTTTTCGGAAGAGATGAAGGCCCCGCGTTTGATGCCCTGCAAACTCTGGCTGATCTGTCATTGATGATGCCAACAGAAAATGAAGATGAATCAAGAGTGCAGTTTAAAGATGAACATGATGATCATGTCGGTGAATCTGTACCATTAGAAGCTCTGCCAGCAAACCAGCCAAGAGAAAAACGTAGATCCTCGGGCGTAAGAATGAAAGGGCATCTTGTATCAAGTTCTGAAGTTGCTCCCAGCAAAACATCAAAACCTGGAAAAAGTTCAATTTTTGATGTTAGTTCTGTTCCTGAAGAAAACCACGATTCTCATCAGCCTATCACCAAAACAACGAGAAAAAAACCAAAAATGCAAGTATCTAAGATTCAAAAATCTGAAGCTCATCCGGATATTCATCTTGGTGAATCTCTGGGGAGTGAGGTTGGAGATGCAGGGAAGAAATTAACGAGCAAGATCAAGAAATCAGCCCGAAGCAGTTCACCAAAATTGATGAAAATTTCAGAAAATTCTTCCAGTGCTGATCTACGGAAAGAAGGGAGCGATTCAGCCCAATCAGACATACAGGTTCCTGTGGTTAACCAGGTCAATTTACCTACTAAAGTTAGAAGCAGGCGTAAAATGAACCTGAAAAAACCACAGATTCAGAAAGATTTGAAATTTCCGGATAAAATCTCTGATGACCAGAGTAATCTGCCTTTTGGTTCACTCCATAACACAGCATTTAATTTAAAGGAAAAACTGTCTAATTGTTTGTTGAATCAGCGTTTGAGGAGATGGTGCACTTATGAATGGTTCTATAGTGCCATTGATTATCCATGGTTTGCAAAAAGCGAGTTTGTTGAATATTTGTATCATGTTGGATTGGGTCATGTTCCAAGATTAACTCGTGTGGAGTGGGGTGTCATAAGAAGTTCTCTTGGTAAACCACGACGATTTTCGGGGCAATTCCTGAAGGAAGAAAAGGAGAAGCTTAATCAGTATCGGGATTCCGTTAGGAAACATTACACTGAGCTCCGTGAAGGCGTAAGGGAAGGATTGCCGACCGACCTTGCAAGGCCTTTGTCAGTTGGACAGCGTGTCATTGCAATTCACCCCAAAACAAGAGAGATTCATGATGGAAGCGTGCTAACTGTTGATCACTCAAAATGTCGGGTACAATTTGACCGGCACGAACTAGGGGTCGAATTTGTCATGGATATTGACTGCATGCCCTTAAATCCTTTGGAGAATATGCCTGCTTTGCTTGGAAGACACACAGTGGCAGTTGATAAATCTTTTGAGAATTTCAATGAACTACAAATACATGGACGAGCAAAAGAGCACATCAAGCTTTCCCCTGGTGACAATCTGGATAGCATTGATGGTATTTCTCAGTTGCCTCCATTAGCTAATCCTGCCATTTTGTTGGACCAGACAAAGGTTGCTTCTGCAAATACCAATGTGCAGACAAGGATTGGACCTGCTGATGCTGCAACTTACCAGCAGATGGCATATTCTCAGCCTAGTACACTAGCTCATGTCCAGGCAAAGGAAGCTGATGTTCAAGCTCTTGCTGAGCTGACTCGTGCTCTTGATAAAAAGGAAGCTATTGTTCTTGAGTTGAGGCGCATGAATGATGATGTGTTAGAAAATCAGAAGGATGGTAATAGCTTTTTGAAAGAGTCAGAGCCATTTAAAAAGCAGTATGCTGCAGTACTTATACAATTGAATGATGCCAATGAGCAGGTTTCTTCAGCTTTACATTGCCTGAGAGAACGGAACACATATCAAGGTAAATGTCCACTTACATGGCCGGGGCCGGTGAGCAATCATGCTGATGCTGGTGGCACATTGAACTCCTCTGATCGTTCTGCATATCAAACTCAAGAATCAGGATCAAATGTGAATGAAATCATGGATAGCTCAAGAACTAAAGCCCGAAAAATGGTGGATGTAGCTATGCAGGCAATATCATCACTGAAGAGTCGGGAGGATACCATCGAGAAAATTGAGGAAGCTATTGATTATGTAAATGACCGGCTTCCCTCGGACGATTCTTGCATGCCAGTGGCATCCGATCCTAAATTAATGAATTCATCCGATATCTACACTCAAATTCCTTCAGAGCTGATTGGAAAATGCGTAGCGACTTTGCTCATGATTCAGAAGTGTACAGAAAGGCAGTTCCCTCCATCAGATATTGCAGAGATACTAGATTCTGCTGTAACAAGTTTACAGCCACACAGTTCTCAAAACCTTCCTGTTTATACAGAAATACAGAAGTGCGTGGGCATCATCAAGAACCAAATATTGGCACTAATACCGACTTAG
Protein:  
MGPTRKSRSVNKRYSQVNEVSPSKDGDGSKRSNSRKRKLSDMLGPRWTMEELTRFYDSYRKNGKDWKKVANAVKNRSSEMAEALYTMNRAYLSLPHGTASAAGLIAMMTDHYCNLAGTDSDQESNDGVESSQKTQKRARGKVQPPTSKPSADPFVPHSPTITSNYGCLSLLKKKRSGGTRPRPVGKRTPRFPVSYSYENINGEKYFSPTRQGLKLKASTDDDEVAHEVAIALAEASQRGGSPQVSGTPSKRAESVMSSPFRHAERKNSVAEMVNAKPLAADTDEEDLEGSTEADTGELSGYKPCMTESASFLTTRQKGTKVEGKKIEVDNNNQSHLDNINEECSGTEEGQRLGATSGKFDVEVNNTKNSRSFMQSQKKKSKKVLFGRDEGPAFDALQTLADLSLMMPTENEDESRVQFKDEHDDHVGESVPLEALPANQPREKRRSSGVRMKGHLVSSSEVAPSKTSKPGKSSIFDVSSVPEENHDSHQPITKTTRKKPKMQVSKIQKSEAHPDIHLGESLGSEVGDAGKKLTSKIKKSARSSSPKLMKISENSSSADLRKEGSDSAQSDIQVPVVNQVNLPTKVRSRRKMNLKKPQIQKDLKFPDKISDDQSNLPFGSLHNTAFNLKEKLSNCLLNQRLRRWCTYEWFYSAIDYPWFAKSEFVEYLYHVGLGHVPRLTRVEWGVIRSSLGKPRRFSGQFLKEEKEKLNQYRDSVRKHYTELREGVREGLPTDLARPLSVGQRVIAIHPKTREIHDGSVLTVDHSKCRVQFDRHELGVEFVMDIDCMPLNPLENMPALLGRHTVAVDKSFENFNELQIHGRAKEHIKLSPGDNLDSIDGISQLPPLANPAILLDQTKVASANTNVQTRIGPADAATYQQMAYSQPSTLAHVQAKEADVQALAELTRALDKKEAIVLELRRMNDDVLENQKDGNSFLKESEPFKKQYAAVLIQLNDANEQVSSALHCLRERNTYQGKCPLTWPGPVSNHADAGGTLNSSDRSAYQTQESGSNVNEIMDSSRTKARKMVDVAMQAISSLKSREDTIEKIEEAIDYVNDRLPSDDSCMPVASDPKLMNSSDIYTQIPSELIGKCVATLLMIQKCTERQFPPSDIAEILDSAVTSLQPHSSQNLPVYTEIQKCVGIIKNQILALIPT